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Abstract. We define representations of continuous functions on infinite streams of dis- 
crete values, both in the case of discrete- valued functions, and in the case of stream- valued 
functions. We define also an operation on the representations of two continuous functions 
between streams that yields a representation of their composite. 

In the case of discrete-valued functions, the representatives are well-founded (finite- 
path) trees of a certain kind. The underlying idea can be traced back to Brouwer's justi- 
fication of bar-induction, or to Kreisel and Troelstra's elimination of choice-sequences. In 
the case of stream-valued functions, the representatives are non-wellfounded trees pieced 
together in a coinductive fashion from well-founded trees. The definition requires an al- 
ternating fixpoint construction of some ubiquity. 



Introduction 

This paper is concerned with the representation and implementation of continuous 
functions on spaces of infinite sequences or streams of discrete values, such as binary digits 
(Cantor space), or natural numbers (Baire space). That is to say, we will look at functions 
of type 

where A is a discrete space, is the space of streams of elements of A with the product 
topology, and X is either a discrete space B, or itself a space of streams B'^. We use the 
symbol =^ for the continuous function space. Functions of this kind and closely related 
kinds arise in many contexts in mathematics and are pervasive in programming, as with 
pipes, stream input-output and coroutines. 

1998 ACM Subject Classification: 68P05, 68N18, 54C35. 

Key words and phrases: Streams, continuous functions, initial algebras, final coalgebras. 
" Hancock's work was supported by EPSERC grant EP/C511964/1. 



I— C LOGICAL METHODS 
I"— ^1 IN COMPUTER SCIENCE 



D0I:1 0.21 68/LMCS-5 (3:9) 2009 



© P. Hancock. D. Pattin son. and N. Ghani 
@ 'Creative Commonsl 



2 



P. HANCOCK, D. PATTINSON, AND N. GHANI 



If one is to implement such a function by means of a program or machine that consumes 
successive values in an input stream, and produces a value (all at once in the discrete 
case, or in a stream of successive values in the stream- valued case), it seems necessary 
that the function be continuous. Otherwise, the whole input stream would be needed at 
once: an output would be forthcoming only 'at the end of time'. Continuity means that 
finite information concerning the output of the function is determined by finite information 
concerning its input. In the simpler, discrete-valued case, this amounts to the requirement 
that the value 6 = / a of the function at argument a is determined (or 'secured') by some 
finite prefix a„ = (ao, ai, 02, . . . , O-n-x) of ct- 

It is fairly clear how to represent continuous functions on with discrete values in B: 
take a well-founded tree branching over with i?'s at the leaves. Such a tree represents a 
continuous function. Start at the root, then use successive entries in the argument stream 
to steer your way along some path to a leaf. When you arrive at the leaf, as you inevitably 
must in view of the tree's being well-founded, there is your value for that argument. We 
can visualise the representation as follows. 




/ : 2^^ ^ 2 

/(o,...) 
/(i,o,...) 

/(1, 1,0,...) 
/(1,1,1,...) 



= 

= 1 

= 

= 1 



At the black inner nodes, the representation 'cats' the next entry in the argument 
stream, and goes left or right according to whether it's or 1; at leaves, it 'spits' the 
(boxed) value for that argument. 

It should be noted that there will be several (actually, infinitely many) representations 
of the same function. For example, the tree below represents the same function as the 
one above, where (of course) two functions are the same if their values are equal for all 
arguments. 




It is perhaps a little less obvious that the representation sketched above is complete: 
any continuous function / : ^4'^ B is representable in this way. The most straightforward 
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argument is irredeemably classical: suppose the function has no representation, and derive 
from this supposition a stream at which it is not continuous. However the completeness 
of the representation can be established constructively, given only the validity of a certain 
principle of called 'bar induction', asserting the equivalence of two notions of barred-ness, 
or covering in Baire space. Here a 'bar' is a monotone subset of A*, the set of finite lists 
of j4's. One notion of barred-ness is weak, having the form of a quantification over infinite 
sequences 

The other notion of barred-ness is strong, being inductively defined, and so having essentially 
the form of a quantification over subsets of A* 

{MU C A*)B U { c : ^* I (Va : A)U{c ^ a)} CU ^U{) 

where () denotes the empty list, and c ^ a the list c with a further entry a at the end. 
Using a variant of this principle, one can show that the two notions of continuity on Baire 
space (one the usual epsilon-delta definition, the other defined inductively) coincide. This 
principle is closely related to Brouwer's 'Bar Theorem', for which he presented a fascinating 
but fallaciou^ argument in three articles. There is an extensive discussion of Brouwer's 
argument by Dummett in [f| pp 68-75], and a more formal analysis of bar induction by 
Howard and Kreisel in [8]. A very penetrating discussion of bar induction that is closely 
related to our representation of continuous functions by well-founded trees is given by Tait 
in [ig. 

The inspiration for our representation of continuous functions with discrete codomain 
was in fact Brouwer's argument for bar-induction, that conjures an inductive structure from 
a proof of a Il\ statement Va3n. . .. This inductive content was made explicit by Kreisel 
and Troelstra j20l 8.4, p225] in the form of the class 'K' of neighbourhood functions central 
to their so-called elimination of choice sequences, discussed in jU pp 75-81] and [H]. In 
this paper, we put this inductive structure into a datatype. In fact the paper of Tait's just 
cited contains (at the bottom of p. 195) a definition of what amounts to the function eat in 
section \2A\ below, differing only in notation. We claim no originality for this insight. 

Now what about stream- valued continuous functions on A'^ with values in B^? The 
idea is again quite simple, though as far as we know, new. It is also difficult to depict. What 
we want is a non-wellfounded tree, branching over A, along every path of which there are 
infinitely many nodes labelled with an element of B. Start at the root, then use successive 
entries in the argument stream to steer your way along some path. Whenever you arrive 
at a node labelled with an element of B, as you will inevitably do infinitely often, emit 
that element as the next entry in the output stream. It turns out to be straightforward 
to express the type of trees we need as a nested fixed point, in which one forms the final 
coalgebra of a functor that is defined using an initial algebra construction. 

Is this representation complete? It turns out that every stream-valued function on 
streams is representable by a non-wellfounded tree of the kind we have described, though 
the argument is perhaps a little intricate. 

Our main contribution, non-trivially extending the state of the art in the 1960's, is to 
formulate a represention of stream processing components (continuous functions between 



Brouwer did not place any restriction such as monotonicity on B. As explained by Dummett [4] pp 
68-75] this is definitely an error. 
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streams, including their composition), different from that customary in the logical litera- 
tur^, that fits better with practical implementation of stream computation. 

The datatype of representations provides a convenient basis for writing stream process- 
ing components in a functional programming language such as Haskell. Nevertheless, the 
coding in Haskell is not entirely satisfying. The chief advantage of using our data type to 
program stream processing components is that it ensures liveness, through the use of mixed 
inductive-coinductive types. The foundations of Haskell are located in a theory of partial 
functions, and not functions in the standard mathematical sense. Totality is something 
extrinsic, beyond the scope of the type system. Our approach guarantees that the stream 
processors are total. So it might be better expressed in a language for total functional 
programming, as advocated by Turner [21], and approximated in systems such as Epigram 
and Agda. This means that evaluation of the constructor form of the value of a function at 
an argument in its domain must terminate, in our opinion something to be striven for in a 
practical programming language. 

It seems that there are lessons to be learnt from this work for the design of formalisms 
and systems for developing dependently typed programs. It is not yet entirely clear what 
facilities for coinductive definition and reasoning such systems need to provide, and in what 
form. It seems firstly that facilities for inductive-recursive definition may be needed in 
connection with coinductive structures: the neighbourhoods in coinductive types have an 
inductive-recursive construction. (Admittedly, this structure does not become fully evident 
until we consider more general coinductive datatypes than streams.) Secondly, inductive 
and coinductive definitions are sometimes nested within each other (as in (ly X) (fiY) B x 
X + y^). Coding our constructions in current systems for dependently typed programming 
has revealed a number of deficiencies and errors in these systems. Dealing with recursive 
definitions in which induction and coinduction are combined needs careful analysis, that 
in our opinion should be based on the universal properties of initial algebras and final 
coalgebras. 

The paper is organised as follows. 

• Section [D preliminaries. 

• Section [2) we define the representation of the continuous function space =^ B hy the 
datatype T^B = {fi X) B + X^ of wellfounded trees branching over A and terminating 
in B, and show it is complete in the sense that each such continuous function has a 
representative (in fact many) in TaB. This part of the paper is in essence fairly well 
known. 

• Section [3) we define the representation of the continuous function space A^ =^ B'^ by 
PaB = (i^Y) Ta{B X Y). The main contribution here is the proof of completeness, which 
is not completely straightforward. The proof is constructive, given completeness in the 
discrete-valued case. 

• Section H) we define two representations of composition, as operators of type PbC x 
PaB — > PaC, and show their correctness. As far as we have been able to discover, this 
representation is new. 

• Section [5j in conclusion, we summarise what has been done, point out related work, and 
indicate some directions for further work. 

^According to this, if fc/ represents a function / : u)"^ B, then (j> : oj" — > is represented by fc/ where 
f{n I a) = <^(q, n). This manoeuver works only when a is a stream of natural numbers, or encodable as such. 
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The main definitions of the paper can be transcribed quite simply into Haskell. A Haskell 
encoding can be found at http : //personal . cis . strath . ac . uk/ -vng/eating . hs. 

1. Preliminaries 

Wc assume the reader is familiar with the categorical notions of product, coproduct, 
and exponential, and standard notations associated with these. We use '-'as infix notation 
for composition, with (as usual) postponent at left and preponent at right. 

1.1. Streams. If yl is a set, we write for the set of countably infinite streams {lo- 
sequences) of elements of A, and A* for the set of finite sequences (lists) of elements of 
A. 

We use Greek letters a, (5, ... as variables over stream types. We overload the infix 
operator (?) (with section notation) our basic means of constructing both streams and non- 
empty lists. Thus if a : ^, then the following functions prefix a to streams and to lists. 

(at) : A'^ ^ A'^ (a^,) : A* ^ A* 

We also have the empty hst o : A*. 

As destructors of streams we use hd and tl. 

hd:A^^A 
tl:A^^ A"" 

For all a : ^ and a : A^ we have 

hd{a°,-) = a : A 

i/(_ia) =a:A'^ 

a = {hd a) ',{tl a) : A^ 

Here we have for clarity written _ for parts of expressions that need not be named. The 
destructors hd and tl are used implicitly in pattern-matching definitions. 
We sometimes write ao for hda^ and a' for tla. 

We use the function () : A^ — > {A*y^ which returns the stream of finite prefixes of its 
argument. It is defined by a(0) = o and a(n -|- 1) = a(0) §(Q!')(n). 

Streams are endowed with a topology in which the neighbourhoods are given by finite 
sequences c : A*. Each such represents the predicate iVc = {a | c = a{lenc) } of streams 
sharing prefix c. We usually suppress the distinction between c : A* and Nc C A^. The 
relation a & N c can be defined by recursion on list c. 

We use =^ for the continuous function space. Thus A^ ^ X consists of the continuous 
functions from A^ to X, where X is either a discrete space D, or a space D'^ where D is 
discrete. 

• A discrete valued continuous function / : A^ ^ D is continuous at a : A^ if there is some 
neighbourhood of a throughout which / is constant. In other words, there exists n € w 
such that / has the same value throughout the neighbourhood an. 

image f {an) = { f a} 

A^ ^ D consists of functions that are continuous throughout A^ . 
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• A stream-valued continuous function / : is continuous at a : if (Vn G 

uj)(3m € Lo)f{a{m)) C / an, or in other words to find out a finite amount of information 
about the value, one need only provide a finite amount of information about the argument. 
A^ consists of functions that are continuous throughout A^ . If m does not depend 

on a, the function is uniformly continuous. Such a function / is contractive if it decreases 
the distance between streams. Prime examples of contractors are the functions (a?) : 
A"^ ^ A"^, indexed by a : ^. 



1.2. Initial algebras and final coalgebras. We use (/xX) F{X) = jiF and {y X) F{X) = 
vF to denote initial and final coalgebras for an endofunctor typically an endofunctor on 
the category of sets. 

Initial algebras In general we use in for the structure map into the carrier of an initial 
algebra. Thus in : F{fiF) — > n{F). Given an algebra C, 7 : FC — > C, we let fold{C;'y), or 
simply fold 7 to denote the unique morphism 5 : ^F ^ C such that 

6 ■ in = ^ ■ F 6. 

We use in~^ for the inverse of the structure map, namely fold{Fin). 

Example: finite sequences A* = {^X) 1 + A x X . We use o and (?) as constructors 
associated with _*, so 

1 2 ^ A* 

(I) 

Ax A* ^ A* 

in = [o|(§)] :l+AxA*^A* 

Example: T^B = (fiX) B + X"^, defined in section [2j The bifunctor TaB is covariant 
in B, and contravariant in A. For fixed A, Ta '■ Set — > Set is actually the free monad over 
the functor (_)"^ (alias {A — >■), known as the reader monad). Intriguingly, our constructions 
all pivot on the freeness of this monad. is also known as the tree monad. We use Ret 
and Get for the constructors associated with T^- Thus 

B — ^TaB 

(TaB)^ TaB 

in = [Ret I Get] : B + (TaB)^ TaB 

Final coalgebras In general we use out for the structure map from the carrier of 
a final coalgebra. Thus out : vF F{vF). Given a coalgebra C,j : C ^ FC, we use 
unfold{C;^), or simply unfold 'y (also called the coiteration of 7) to denote the unique 
coalgebra morphism 5 : C ^ vF such that 

out ■ 6 = F 5 ■ ^ 

We use out~^ for the inverse of the structure map, namely unfold{F out). 

Example: streams A^ . We use hd and tl to access components of a stream, out = 
{hd,tl) : A^ ^ Ax A"^, while out~^{a,a) = a',a. 

Example: PaB = {i'X)Ta{B x X), defined in section [3j 
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2. Discrete codomain 

Recall (from section ll.2p that TaB = {fiX)B + X^. In this section we define a 
function eat of type TaB — > A'^ ^ B x that allows us to represent continuous functions 
in A'^ =^ B using elements of TaB. Then we give a non-constructive argument that this 
representation is complete. 

2.1. Definition of eat. Let MaB = A^ — > (i? x A^) be the state monad, with state set 
A^. (The state is the suffix of the input stream that remains unread.) The unit and bind 
(infix operators of the state monad are as follows. 

r]-. B ^ MaB 

(>:=) : MaB -^{B ^ MaC) MaC 
r]{b) = (A a) {b, a) 

[m /) = (A a) let (i, a') = ma in /(i, a') 

Note that Ma supports the operation of reading one input: 

get : MaA 
get a = (ao, a') 

This function plays an important role below in the guise of {hd, tl) : A^ = ^ x A^ . 
The most straightfoward definition of eat is by structural recursion. 

eat : TaB MaB 

eat (Ret h) = rjh 

eat {Get (p) = get ^5= {eat ■ (p) 

2.2. Completeness. The following result is in essence well known. 

Theorem 2.1. (Completeness of representation of A'^ i? by TaB.) There is a function 
rep : {A'^ ^ B) ^ TaB such that if f : A"^ ^ B then ttq ■ {eat{rep /)) = / 

Note that rep picks a representative for a continuous function from those that give rise 
to extensionally the same function. When A is infinite, there are uncountably many such 
representatives. 

Proof. ( Classical) Suppose that some function / : A^ B has no representative. We 
'construct' an argument a : A^ at which / is not continuous. Thence, if / is continuous at 
all arguments, there exists some r : TaB such that eat{r) equals /. 

Starting with / : A^ — > i?, we 'construct' an infinite sequence of functions without 
representatives. In the first place, for some a : A, the function f -{ai) has no representative. 
(Else / itself would have a representation.) By a form of the axiom of dependent choices, 
if / : A^ — i- B has no representative, then for some a : A^ , none of the functions 

/o = /, /i = /o • ("0 i), /2 = /i • (ai l) = / • («o i) ■ {aii),... 

have representatives. In particular, none of these functions can be constant. It follows that 
/ is not constant in any neighbourhood of a, and so / is not continuous at a. □ 
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The structure of this proof is discussed in Dummett [4, pp 49-55], and Troelstra and 
van Dalen |20l 8.7, p227]. Of course, there are other proofs that do not make use of 
constructively ilhcit forms of contraposition. For example, there seem to be proofs that 
use instead monotone bar-induction, and are arguably intuitionistically valid. However as 
indicated by Tait in |18l pp 194-196], the best we can hope to achieve from a constructive 
point of view is to find models of suitable systems of constructive reasoning to which we 
have adjoined an axiom asserting that a function of type — s- B continuous in the weak 
e—6 sense is always continuous in the strong inductive sense. It is to be expected that such 
a model would refute Church's thesis. 



3. Stream codomain 

Recall (from section [L2]) that PaB = {i'X)Ta{B x X). The previous section gave a 
complete representation of discrete valued continuous functions A'^ ^ B, where A and B 
are discrete. We turn now to stream-valued functions. First we define a function eatoo with 
type PaB A^ =^ B'^. We have not been able to find a similar representation in the 
literature. Then we provide it with a right-inverse rep^. 

Of course, A'^ B^ is isomorphic to {A'^ B)^, and so its elements can be represented 
by streams of representations of A'^ =^ B. However such a representation would be unusable 
in practice, as the same input stream would have to be scanned again and again to produce 
successive items in the output stream. 

3.1. Definition of eatoo- We define eatoo to be the curried form of a function e of type 
(PaB) X A'^ B'^ that is continuous in its second argument. Since B^ is a final coalgebra, 
to define a function into it, it is enough to define a coalgebra for {B x ) with carrier PaB x A^ . 



So 



Then 



PaB X 

out xl 

Ta{B X PaB) X A"^ 

eatxl 

Ma{B X PaB) X 

app 

{B X PaB) X A"^ 



B X {PaB X A'^ 



B^ 



Ixe 



(hd,tl) 



B X B'^ 



e : PaB x A'^ ^ B'^ 

e = unfold{assoc ■ app ■ {{eat ■ out) x 1) 



eatoo 
eatoo 



: PaB A^ 
= curry e . 



B^ 
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A more down-to-earth or humane presentation of the definition of eatoo fohows, as it 
might be written in a functional programming language. 

data Ta b = Ret 6 | Get(a ^ T ah) 

eat :: T ah Str a — >■ (6, Str a) 

eat (Ret h) as = (6, as) 

eat (Get /) (a : as) = eat (/ a) as 

data Pah = F{Ta{b,Pah)) 
eatoo Pab ^ Str a — > Str b 
eatoo (Pi) = let {{h,p), as') = eatt as in 
b : eatooP as' 

Remark: this definition generalises effortlessly to the case when the codomain is an 
arbitrary final coalgebra for a strong functor F (that is, one equipped with a suitable 
natural transformation strength : F X xY ^ F{X x Y)). Let R = v{Ta ■ F). 



outxl 



Ta{FR) X 

app-{eatxl) 

(FR) X A"" 



strength 



F{R X A^) 



F(e') 



uF 



out 



F{vF) 



Though one can thus represent functions from streams into arbitrary final coalgebras, it is 
not clear what a completeness result for this general representation would be. Without some 
serious restriction on the functor F it does not seem possible to conjure up a useful topology 
on the codomain vF. In fact, this is possible for functors that represent a single sorted 
signature of finite arity operators. We hope to substantiate this remark in a subsequent 
publication. 



3.2. Definition of repoo- The function eatoo allows us to interpret an element of the 
datatype PaB as a continuous function in ^ B'^ . Now we define a function rep^ 
that picks a representative for any such continuous function. In the following subsection, 
we'll show that rep oo is right-inverse to eatoo- 

As the codomain of rep oo is to be the carrier of a final coalgebra for the functor (i? x _) , 
we define repoo as the (unique) coalgebra morphism from a coalgebra for the same functor 
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with carrier =^ B'^, namely p ■ t in the fohowing diagram 



PaB 



TaB X {A^ 
p 

Ta{B X (A'^ 
unjold{p-T). Here 



out 



B-)) ^^^'""-H Ta{B X PaB) 



So rep^ 

^ TaB X {A^ ^ B^) 
Tf = {repihd-f),tl-f) 

The other component p of the structure map of our Ta ■ (-Bx)-coalgebra is a fold. (For 
clarity, we give it a more general type than we need.) It is in some sense a 'fast-forward' 
operation. 

p : TaB X {A^^ ^ C) Ta{B y. {A"" ^ C)) 
p(RetbJ) = Ret (6,/) 
p(Get </.,/) = Get {X a) p{cj> a, f ■ (a 
Remarks: p is actually an isomorphism. It does not change the shape of a tree, but only 
decorates the data stored at its leaves. So, for example, (ttq • eat){t,a) = (ttq • assoc ■ 
eat){p{t, /), a) for any t : TaB and / : A'^ ^ C. 

Although rep cannot be defined constructively, at least without postulating some form 
of bar-induction, the construction of rep^ from rep is a simple matter of programming. 



3.3. Completeness of eat^. Now we want to show that the function ea^oo is surjective. 

It is enough to show that rep^ is a right inverse for eatoQ. 

Theorem 3.1. (Completeness of representation of A'^ =^ B^ by PaB.) 

{eatoo ■ rep^) = Ia'^^B'-' ■ 

Proof. We show that the following relation i? is a bisimulation on B'^, and therefore included 
in the equality relation. 

R = {(/ a, eat^irep^ f, a))\a: A^ , f : A^ ^ B^} 

It is enough to prove that if / : A'^ => B^ and a : A'^, then 

(1) hd{f a) = hd[eat(x,{rep^ f ,a)), and 

(2) tl{ f a) Rtl{eatoo{rep^f,a)). 
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As for (1), 

hd{eatca{rep^f,a)) 
= (ttq ■ assoc ■ eat){out{rep^ /), a) 

= (ttq • assoc ■ eat){(TA ■ {Bx))rep^ ■ p ■ {rep x 1) • tl-))f,a) 

= (tto • assoc • eat){{TA ■ {Bx))rep^ ■ p){rep{hd ■ /), tl ■ /), a) 

= {{Ta ■ {Bx))rep^ ■ p doesn't affect shape, or first coordinate of data } 

(ttq • eat){rep{hd ■ f),a) 
= {Completeness in the discrete-valued case } 

hd(f a) 

As for (2), we start by expanding definitions. 

tl{eatooirep^f, a)) 
= (eaioo ■ 7i"i • assoc ■ eat){{TA ■ {Bx))rep^ ■ p){rep{hd ■ J), tl ■ f),a) 

We have to show that that for all f : ^ B"^ smd a : , 

tl{f a) R {eatoo ■ tti ■ assoc ■ eat){{TA ■ {Bx))rep^ ■ p){rep{hd ■ /), tl ■ f),a) . 

By completeness in the discrete- valued case, it is enough to show that for all t G TaB, 

/' : ^ B'^ and a : A'^, 

f'{a) R {eatoo ■ tti ■ assoc ■ eat){{TA ■ {Bx))rep^ ■ p){t,f'),a) . 
We argue by induction on the wellfoundcd structure t. 

• In the base case that t has the form Ret b, calculation shows that 

{eatoo -TTi • assoc ■ eat)({TA ■ {Bx))rep^ ■ p){t,f'),a) 
= eatoo{rep^{f'),a) . 

But (fa) R eatoo{rep^{f'),a)), so we are done with this case. 

• In the step case that t has the form Get (f>, calculation shows that 

(eoioo -VTi • assoc - eat){{TA ■ {Bx))rep^ ■ p){t,f'),a) 
= {eatoo ■ TTi • assoc ■ eat){(TA ■ {Bx))rep^ ■ p){(p{ao),f' ■ (ao',),a') . 

But by induction hypothesis, 
(/'•(«oO)(«') 

R (eaioo ■ tti • assoc ■ eat){{TA ■ {Bx))rep^ ■ p){(j){ao), f ■ {ao ?), , 
and moreover (/' ■ (ao 9)){a') = f'{a). So we are done with this case too. □ 

4. COMPOSITION 

In the previous section we defined a complete representation for continuous functions in 

=^ B'^ , using elements of PaB. As continuous functions are closed under composition, 
if p : PbC represents ^ : B'^ =^ C^, and q : PaB represents ip : A^ ^ B'^ , then there's 
some r : PaC that represents (p • However, the argument for completeness is less than 
entirely constructive. Can we directly program such an r from p and q? Yes! In fact, in 
at least two different ways, one 'lazy', or demand driven, and the other 'greedy', or data 
driven. The computation is reminiscent of cut-elimination in proof theory, though in this 
case the objects that interact with each other are infinite, non-wellfounded trees, rather 
than wellfounded derivation trees. 
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4.1. Definition of composition as an operation on representatives. We define (using 
coiteration) an operation ('(g)') on representations of stream functions that represents tlie 
composition of those functions, in the sense 

eatoo{p (8> g) = eatooP ■ eat ooQ 

for p : PbC and q : PaB. 

First, we define a coalgebra x for the functor • (Cx). The carrier will be the product 
S = Tb{C X PbC) x Ta{B x PaB). First, we present the defining equations for x in 
pattern-matching format, as they might be written in a functional program. (This means 
that in the third equation, tbc must have the form Get (f).) Then we show how to analyse 
this code into nested structural recursions, and so demonstrate that x is not just a piece of 
code, but actually a function defined by universal properties of the functors Tb and Ta- 
postponent preponent 

X -TBiC X PbC) xTa{B x PaB)^ Ta{C x S) 
x(Ret (c,Pbc) ,tab) ='Ret{c,{outpbc,tab)) 
x{Get(j) ,Ket {b,pab)) = x{4>b, out pab) 

x{tbc ,GetV') = Get {X a) x{tbc,ip a) 

Note that in the second equation, x at Get cf) is defined in terms of % at (?!) b, and hence the 
postponent 'goes down' one step in the outer structural recursion (though the preponent 
may 'go up', arbitrarily far). 

It is routine to tease the recursion into the form of nested structural recursions. The 
outer recursion is on the structure of the postponent Tb(C x PbC), with an inner or 
subordinate recursion on the structure of the preponent Ta{B x PaB). To write it down, 
we use a polymorphic function 

fold : (5 ^ C) -> {{A ^ C) ^ C) TaB -> C 

fold p g (Ret b) =pb 

fold pg {Get (I)) =gi{Xa : A) fold p g {(t)a)) 

to express structural recursion over wellfounded trees, or in categorical terms the initiality 
of [Ret I Get] among algebras \p\g\ ■ {B + C^) —>■ C. The definition of x can then be given 
in the form 

xihctab) = foldpgtbctab 

where p : {C x PbC) Ta{B x PaB) Ta{C x S) 

P{c,Pbc)tab = Ret(c, {outpbctab)) 

g:{B^ Ta{B x PaB) ^ Ta{C x S)) ^ Ta{B x PaB) ^ Ta{C x S) 
g f = fold{{X {b,pab)) fb {out pab)) Get . 

Note that the carrier for the algebra of the outer recursion is the function space Ta{B x 
PaB) -^Ta{C X S), while that for the inner recursion is Ta{C x S). 

In this form of composition, priority is given to the postponent's desire to produce 
output. No input is consumed until both the postponent and preponent are reading. 

X gives rise to a composition combinator (g) as follows. First, unfold x'S^ PaC. We 
define (g) by precomposition with this unfold. 

(g) : PbC x PaB PaC 
p®q = {unfold x) {out p, outq) 
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We call (X) lazy composition, since the internal actions of the composite are the minimum 
necessary to respond to demand for data. 

Altenkirch and Swierstra noticed that there is another such coalgebra. We present its 
definition first in functional programming style, using pattern matching; below we show 
how the equations can be teased into nested recursions. 

X' :Tb{C X PbC)xTa{B x PaB)^ Ta{C x S) 
x'ihc ,Get^/;) = Get {X a) x'ihc, 4^ a) 

x'iGetcj) ,'Ret ib,pab)) = x'i4>b, outpab) 

x'(Ret (c,p6c> ,hb) = Ret ic,ioutphc,hb)) 

Because of the top-to-bottom reading of the equations, it is implicit in the last equation that 
tab has the form 'Fiet{b,Pab)- Anthropomorphically, this form of composition gives priority 
to the preponent's 'greedy' desire to read input. Whereas with the 'lazy' form, output 
is produced as soon as the postponent is ready, regardless of the form of the preponent, 
in this greedier form of composition no output is produced until both the postponent and 
preponent are writing. We call the composition combinator (8>' to which x' gives rise greedy 
composition, since the internal actions of the composite are driven by the arrival of data at 
the input. 

What is the mathematical structure of the code for this form of composition? Again, it is 
definition by nested recursion. One might thinlS that the outer recursion is this time on the 
structure of the preponent, and the inner recursion on the postponent. In fact, this would 
not work. In the crucial middle clause (in which the two components communicate), the 
postponent 'goes down' in the structural order, while the preponent may 'go up', arbitrarily 
far. A more careful analysis shows that, again, the outer recursion is on the structure of the 
postponent, with subordinate recursions on the structure of the preponent. In fact there 
is little formal difference from our definition of x above, except that the base case of the 
outer recursion uses another inner recursion rather than a simple explicit definition. The 
local function p then becomes 

p:iCx PbC) ^ Ta{B X PaB) ^ Ta{C x S) 

P ic,Pbc) = fold ((A ib,pab)) Ret(c, iout pbc,'Retib,pab)))) Get 

Unfortunately we currently have little of substance to say about how these forms of 
composition are related. One might well expect that a pipeline implemented with greedy 
composition would be less responsive {i.e. deliver results later) than one expressed with the 
lazy form. 

4.2. Correctness of composition. It remains to prove that the two operations that we 
defined above really represent composition. This pivots on the uniqueness property of 
unfold X- Exploiting the similarity of the definitions for ^ and 0' we can state the following 
basic lemma that applies to both. For the sake of readability, the isomorphism out : PaB = 
Ta{B X PaB) is left implicit. 

Lemma 4.1. Both composition operators © G {0,(8''} satisfy the following laws: 

(1) 'Ret{c,pbc)©tab = 'Ret{c,pbc©tab) (where tab = R-et(6,pafc) in case © = (g)') 

(2) {Get (l))©Ket{b,pab) = (l)b©pab 

(3) pbc©{Get Tp) = Get {Xa. Pbc©ip a) (where pbc = Get cj) in case © = <S) 



did we, at first. 
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Proof. By unfolding the definitions. Actually, it is the desired effect of the definitions that 
we have these properties. □ 

We now set up a bisimulation that shows that tbc ^ Pab and tf,c (81' Pab really represents 
the composite eatooitbc) ■ s.atoo{pab)- Again, the isomorphism PaB = Ta{B x PaB) is left 
implicit. 

Lemma 4.2. 

R = {{eatooiPbc©tab, a), eatoo{Pbc, eatoo{tab, a))) I a G A^} 
is a bisimulation on if © ^ 

Proof. It is enough to prove that 

(1) hd{eatooiPbc©tab,a)) = hd{eatooiPbc, eatoo{tab,a))) 

(2) tl{eatooiPbc®tab,a)),tl{eatoo{pbc,eatoQ{tab,a))) 

for all pbc € PbC and and all tab £ PaB and all a £ A^. The proof relies on the following 
identities, which are readily derived using Lemma |4. II 
Case Pbc = Ret(c, qbc). 

eatoo{Pbc©tab,Oi) = c§ eatoo{qbc©tab,a) 
eatooiPbc, eatoo{tab,a)) = c§ eatoo{qbc, eatoo{tab-,OL)) 
Case Pbc = Get (j) and tab = Ret(6, tab). 

eatoo{pbc©tab,a) = eatoo{{4>b)©tab,a) 
eatooiPbc, eatootaba) = eatooi<pb, eatoo{tab,a))- 

Case tab = Gettp. 

eatooiPbc©tab,a) = eat 00 {pbc©^^ a, a) 

eatooiPbc, eatoo{tab,a°, a)) = eatoo{pbc, eatodi^ a,a)). 

The claim for © = ® now follows by nested structural recursion, the outer induction on 
the postponent, the inner induction on the preponent; for ®' the nesting is reversed. □ 

Corollary 4.3. Both and ®' represent composition, i.e. for all pbc and all tab ^ PaB 

we have 

eatoo{Pbc©tab) = eatoohc ■ eat^oPab- 

for © G {0,®'}. 

Proof. Immediate from the fact that R, defined above, is a bisimulation and the fact that 
all bisimulations on a final coalgebra are contained in the diagonal. □ 

5. Conclusion, related work 

We have defined computationally natural representations of continuous functions on 
streams, and proved completeness of these representations for the classically understood 
notion of continuity. This involved teasing apart the fixed points involved into those that 
are initial and those that are final. We also defined combinators on representations that 
represent the composition of the functions they represent. 

We consider the main point of this paper to be i) a representation of stream processors 
as trees - this ensures that our stream processors are total as opposed to the partial functions 
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which exist in the Haskell function space ; ii) a guarantee that all stream processors 

can be represented by such trees; and iii) a demonstration that these trees are well suited 
to computation — this takes the form of an implementation of the composition of stream 
processing functions directly on the representatives themselves. 

There may also be advantages of a more technical nature. Very often when a function 
is represented by a data structure, such as a wellfounded or infinite tree, the function is 
automatically 'memoised' - its values for particular arguments are recorded in the data, 
and need not be recomputed if they are needed again. For example, the representation of 
functions on finitary inductive types by coinductive trees (in general, final coalgebras for 
certain rank 2 functors) discovered by Hinze [7] and Altenkirch [T] have this property. The 
same phenomenon may occur with our representation of stream functions. However their 
work is concerned with functions on inductive types, as is natural with initial algebras, 
whereas ours is primarily concerned with functions on coinductive types, which is in the 
opposite direction from the universal maps associated with final coalgebras. 

Our representations are not unique, though different representations of the same func- 
tion correspond to computationally different behaviour. Interesting further work might be 
to investigate the equivalence relation between representations corresponding to (exten- 
sional) equality between the represented functions. The relation is clearly not decidable, 
and may be hyperarithmetic or worse (when the data items consumed and produced are 
natural numbers). 

Another question that may deserve further study is to understand and compare the 
relation between the lazy and greedy forms of composition introduced in section HI More 
generally, it may be worth investigating whether there is a real connection between these 
forms of composition and superficially similar forms of composition in cut-elimination, and 
algorithmic game theory. 

The set of streams of values in a set A is perhaps the simplest example of a final 
coalgebra, namely for the functor (xj4), a close relative of the set of natural numbers that 
is an initial algebra for the functor (-1-1). Final coalgebras are sets of 'infinite' values, 
that can model storage, communication and other evolving devices. In other work that we 
hope to publish in due course, we have generalised Brouwer's representations so as to cover 
continuous functions between structures of other coinductive types than streams, that is to 
final coalgebras for a useful class of functors beyond (^x). Broadly the same results can 
be obtained as for the stream case, though the generalisation involves more mathematical 
machinery. The mathematical techniques involve working with indexed families of sets, 
using an inductive-recursive definition (of such an indexed family) in a crucial way. 

It may be possible to extend these techniques yet further to explore representations of 
continuous functions on final coalgebras for finitary indexed containers, that are endofunc- 
tors on slice categories. Some preliminary investigations suggest that this might be rather 
laborious. On the other hand, it could well be worthwhile. Endofunctors of that kind would 
allow us to model non- wellfounded proofs, and so connect our work with Mints' continuous 
cut-elimination [13], analysed by Buchholz in [3]. Another connection that might be made 
is with Brotherston and Simpson's non- wellfounded proof systems in [2\. Yet another is 
with Niwiiiski and Walukiewicz's infinitary proof trees in [15j . 

Stream processing is a very venerable approach to systems design. Streams were used 
in a central way in the 0S6 operating system of Stoy and Strachey |16] . as well as in 
commercial operating systems. The Unix piping system, introduced by Mcllroy, is stream 
based, with buffering handled by the system. In practical programming, a stream facility is 
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often based on something more complicated than a mathematical stream (involving perhaps 
EOF, length, buffering, bounds, putback, ...). These more feature-full streams inhabit 
coinductive types for more elaborate functors than (Ax), but they are not substantially 
different. 

The earliest form of 10 in functional programming languages was stream based (12j : 
a executable program was a (possibly asynchronous) stream processor. Experience quickly 
showed it is easy to make mistakes in programs using asynchronous interfaces. Mature 
implementations of 10 interfaces are therefore based on synchronous processing, consum- 
ing response streams to produce request streams, in a productive or contractive fashion. 
Some early functional operating systems [11] also used streams (sometimes in a ring) for 
communication among system processes. 

The programming system Fudgets |14] is based on a representation of stream processors 
similar to the one in this paper, but without our separation of final from initial fixed 
points. Fudgets are a language for asynchronous stream processing. Various combinators 
are available for building up stream processors. Implementations of Fudgets with Haskell 
have been used to build powerful user interaction (mouse, keyboard, display) interfaces. The 
programming system Yampa [9], which has been used to produce code for robots (among 
other things) uses a synchronous dataflow metaphor, that is well aligned with classical 
control theory, with its signal processors and feedback loops. 

It seems obvious that the semantics of feedback loops involves fixpoints, so it may be 
natural to focus on contractive functions, because of Banach's fixed point theorem (see the 
references in the paper [3j). This states that contractive functions have unique fixed points. 
In their paper "Ensuring streams flow" [19] Turner and Telford have analysed a productivity 
requirement for ensuring unique solutions of recursion equations. Productivity seems to be 
closely related to contractive functions. From another perspective, Buchholz has designed a 
calculus for writing (recursive) stream processing functions, (and even functions processing 
certain not-well-founded trees) which ensures that functions are contractive where required 
[3]. We have not specifically examined the representation of contractive functions, though 
they are prominent in the form of the functions (oq i) in our constructions. Nor have we 
yet considered representations of uniformly continuous functions. 

The notion of arrow, introduced to functional programming by Hughes [10] was devel- 
oped to express compositional infrastructure in programming generalising that of Kleisli 
morphisms for a monad, and crucially interacting with a tensor combinator according to 
some reasonable laws. The reference |6] provides a useful perspective. Abstractly, an arrow 
is a monoid in a certain category of bifunctors. Our stream processors behave quite well 
with respect to composition (•), but it is not clear to us how nicely they play with operators 
such as -|-, X and other multi-input combinators. It may be that one has to get to grips 
with notions of fairness, such as fair merging, in connection with such combinators. An- 
other direction for further development is to investigate combinations of stream processors 
in which, as in many applications of stream processing, there are forms of feedback, or 
looping. 

Acknowledgments. Our colleagues Altenkirch and Swierstra have in unpublished work con- 
sidered the broad topic of modelling impure (effectful) phenomena such as teletype 10 |5], 
mutable heap variables and multithreading. We are grateful to them for interesting conver- 
sations on the topic of stream 10, and in particular for pointing out (and debugging) the 
'greedy' form of composition mentioned in section [H Their model of teletype 10 in [17] . 
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